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Abstract 

A new parameter choice rule for inverse problems is introduced. This pa- 
rameter choice rule was developed for total variation regularization in electron 
tomography and might in general be useful for L 1 regularization of inverse prob- 
lems with high levels of noise in the data. 

1 Introduction 

A standard procedure when solving an ill-posed inverse problem Tf = g, where T : 
X — > Y is an operator which lacks a continuous inverse, is to define a class of continuous 
operators 5a : Y — > X depending on a parameter A and approximating an inverse of 
T as A — * 0. An approximate solution of the inverse problem is then given by S\(g) 
where the regularization parameter A must be selected by some parameter choice rule. 
Numerous parameter choice rules have been proposed, which are suitable for different 
inverse problems. For a nice survey, see Chapter 4]. 

The parameter choice rule proposed in this paper is inspired by the application 
of L x -type regularization methods, particularly total variation (TV) regularization, to 
the inverse problem in electron tomography (ET) . For a survey of total variation image 
restoration methods, see p], and for a comprehensive account of the mathematics in 
ET, see [3J. It turns out that the well-known parameter choice rules are difficult to 
apply to this problem. One reason for this is that the regularized inverses S\ are 
non-linear even if the forward operator T is linear. A second reason is that this inverse 
problem has unususally high noise level in the data, the norm of the noise component 
by far exceeding the norm of the signal. For this reason, in order to recover anything 
at all, it is necessary to make use of knowledge about the statistical properties of the 
noise, not just its magnitude. 

These issues are taken into account by the proposed parameter choice rule. It is 
specifically designed for regularization functionals of L 1 type, and for inverse problems 
aiming at reconstructing sparse objects. As it is applied here to ET data, the method 
is a heuristic, or error free, parameter choice rule in the terminology of [2 . By this 
one understands a parameter choice rule which does not require an explicit estimate 
of the noise level to be made. Instead, the noise level is estimated directly from the 
data, relying on certain assumptions about the nature of the noise. 

It turns out that the method developed for ET can be broken down in several 
components, which can probably be applied independently in various other inverse 
problems. Here an attempt is made to present these components independently. The 
paper is organized as follows. In Section|2] the most general part of the method, requir- 
ing the least structure of the inverse problem, is described. In Section[3jthe framework 
is applied to total variation regularization. Finally, in Section [4] the application to ET 



is developed, and numerical examples are presented. A general discussion is given in 
Section [5] 



2 Basic principle 

Consider the following inverse problem. Let T : X — » Y be a linear operator between 
two linear spaces, with Y a Hilbert space. Let / true £ X be an unknown element, which 
it is our goal to estimate. What is known is an element g data = xf tlue + g nolsc £ Y 
where g nolS0 is a sample of a random vector G nolse . Let us assume that E[G nolS0 ] = 0. 
However, the probability distribution of G nolsc may not be completely known, and 
might to some extent depend on J true . 

As a regularized inverse of T we consider a class of (non-linear) operators S x : Y — » 
X depending on a regularization parameter A and defined by 

S x (g) := argmin /6X R x (f) + \\\Tf - g\\ 2 (1) 

where R\ : X — > R is a regularization functional parameterized by A. The idea is 
that the reconstruction / rcc = S x (g a ) might be a good approximation to / truc for 
suitable choice of the regularization parameter. 

Let us assume the existence of a unique minimizer of the optimization problem 
in 0. Let us also assume that R\ is convex, and that 

R x (af) = \a\R x (f), Vaet. (2) 

The problem considered here is how to choose the regularization parameter A. 
Various methods for choosing regularization parameters are of course known. The 
method proposed here is motivated by observing the solution of optimization problems 
similar to 0, but where / is restricted to vary along a line in X. Explicitly, if / € X 
and jeF, define 

a\(f,g) ■= argmin aeR i? A (a/) + ^\\T(af) -g\\ 2 . (3) 

Contrary to (JTJ, the solution of ^ can be computed explicitly. 

Lemma 1. Let f G X and g £ Y, and suppose that R x satisfies |2j. IfTf^O 
R\(f) > then d3J) has a unique solution given by 



or 



(Tf,g)-R x (f) 



(Tf,g)>R x (f) 



a X (f,g) = < 



ll?7|| 2 

0, \{Tf,g)\<Rx{f) (4) 



(Tf,g) + R x (f) 



(Tf,g) <-R X (f). 



\\Tf\\ 2 

If on the other hand Tf — and R X (f) = 0, the solution is not unique. 
The proof is a straightforward computation. 

Now we want to look at the random variable a x (f, G nolsc ). The intuitive idea 
is that if a x (f,G nmsc ) is close to with high probability, this is an indication that 
S x (g data ) is not heavily influenced by noise. This intuition may or may not be correct, 
depending on the nature of the regularization functional and the forward operator. 
The precise conditions needed for the idea to be valid are not yet clear. 
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Let us for any / G X define 



°{f) '■= Var K I 7) G noiBe )] 1/2 (5) 

and 

sx(f) ■= (6) 

Lemma [l] indicates that if s\(f) ^> 1, then a\(f, G nolsc ) = with high probability. 
On the other hand, if s\(f) <C 1, then ct\(f, G nolsc ) is not strongly affected by the 
regularization functional. 

The conclusion is that the values of s\(f) for different f E X might serve as 
quantitative measures of the strength of the regularization as compared to the noise 
level. This leads us up to the formulation of the basic form of the proposed parameter 
choice rule: 

1 . Choose a finite set F of elements in X. 

2. For each / G F, choose s m in(/) € R- The choice of s m i n (/) determines how 
strongly the inverse problem is regularized. As a rule of thumb, s min (f) > 5 
corresponds to very strong regularization, while s m - m (f) < 1 corresponds to 
weak regularization. 

3. Choose the smallest regularization parameter A such that s\(f) > s m in(/) for 
all / G F. 

In Section [3] a suitable choice of the set F and a model for computing s m i n (/) is 
given for the special case of TV regularization. In Section [3] a method for estimating 
<r(/) for ET data is provided. With these ingredients we will then be ready to apply 
the parameter choice rule in TV regularized ET. 



3 Application to TV regularization 

Let us specialize the setting as follows. Let f2 C M" be a bounded open set and let 
X be the space of functions of bounded variation with support in f2, with the total 
variation norm 

||/||tv:=su P jjT fV-hdx:heC%(R n ,R n ), \h(x)\<l\ (7) 

where Cq(]R™,]R™) denotes the space of continuously differentiable functions from R n 
to R" with compact support and V • h is the divergence of h. For continuously differ- 
entiable functions / (among others) the total variation is given by 

II/IItv= / \Vf{x)\dx. (8) 

However, the space X contains many non-differentiable functions, including for exam- 
ple the characteristic functions of certain sets, known as Caccioppoli sets. Caccioppoli 
sets include all sets with C 2 boundary, see [1] for details. 

In TV regularization, the regularization functional is chosen to be R\(f) ■= A||/||tv- 
Also, let us assume that cr(/) is translation invariant, meaning that if /i,/2 G X 
are related by fi(x) = fa(x — x ) for some x G E n , then cr(/ x ) = cr(/ 2 ). Since 
||/i||tv = II/2IITV, it then follows from the definition of s\ that sa(/i) = sa(/2)- 
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3.1 Choice of the set F 



Here we must choose a finite subset F c X which is somehow representative of all 
functions in X. Lemma [2] below suggests that it is reasonable to restrict attention to 
characteristic functions of Caccioppoli subsets of il. 

Suppose that E c fl is a Caccioppoli set, and that E = E\ U E 2 with E\ n E% =0. 
Let fi and fi be the characteristic functions of Ei and E2, so / := fx + f% is the 
characteristic function of E. Then <r(/) < cr(fi) + cr(f2) and ||/||tv = ||/i ||tv + II/2IITV 
and it follows that 

ff s ^ JI/i||tv + II/2||tv «a(/i)o"(/i) + s A (/2)o-(/ 2 ) . 

Hence, if fi and fi are both included in F, there is no need to include / unless 
•Smin(Z) > min{s m i n (/i), s m i n (/2)}. From this observation it would be tempting to 
conclude that only characteristic functions of connected Caccioppoli sets need to be 
included in F. However, this is not strictly true, since there are very complicated Cac- 
cioppoli sets, which for example can have uncountably many connected components. 
Nevertheless, of all functions that are likely to be treated numerically in practice, 
only characteristic functions of connected sets need to be included in F. By similar 
reasoning, if 17 has connected complement, only characteristic functions of sets with 
connected complement need to be included in F. 

I further suggest that in many cases it should be reasonable to choose F as a set of 
characteristic functions of balls of different sizes. The precise arguments for this and 
the conditions under which they are valid remain to be clarified. 

Suppose we make the choice to let F consist of characteristic functions of balls. 
We would then choose a finite set D — {di, . . . , d^} of diameters of these balls. By the 
assumption of translation invariance, it is sufficient to include one ball of each diameter 
in F, so we have F — {fd ■ d <E D} where fd denotes the characteristic function of an 
arbitrary ball of diameter d. 

To conclude this section, we state and prove the lemma which was used above to 
motivate the restriction to characteristic functions of Caccioppoli sets. 

Lemma 2. Suppose there exists a constant C such that o~(f) < C||/||tv an d a {f) ^ 
Cli/lli 00 f or a M f € X. If s\(f) > s > for every f £ X which is the characteristic 
function of a Caccioppoli subset of£l, then the same inequality holds for every f € X. 

Remark 1. From the hypothesis of the lemma it is trivially true that s\(f) > X/C. 
However, the constant C could a priori be very large. The point of the lemma is that 
if a better estimate holds for characteristic functions of sets, then the same estimate 
necessarily holds for all functions of bounded variation. 

Proof. The idea of the proof is simply that an arbitrary function is a superposition of 
characteristic functions defined by its level sets. 

Note first that it is sufficient to prove the statement for positive functions. For 
assuming that this has been done, an arbitrary function can be written as / = /+ — /_ 
where /+ and /_ are positive and ||/||tv = ||/+||tv + ||/-||tv- Since a(f) < a(f+) + 
<r(/_) it follows that 

m> JI/+IItv + ||/-||tv sa(/+M/+) + sa(/-M/-) 

So suppose that / is a positive function. For t > define Xt to be the characteristic 
function of the set {x : f(x) > t}. By the coarea formula [11 Theorem 1.23], it holds 
that 

/>oo 

II/||tv= / IMlTvd*. (9) 
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If it can be shown that 

v(f) < / <r(Xt)dt (10) 
Jo 

the conclusion follows, since then 

ff\ ^ A io°° Wxthvdt J Q °° s x {xt)cr(xt)dt 

s\U) > — p>c— : — r-7— - 756— — r-77 > s o- 

Jo <r{Xt)dt J a{xt)dt 

Let = to < ti < i 2 < • • • < t m be a sequence of positive numbers, and define 
8i = ij — 1. For each i = 1, . . . , m there is some 6 [ii-ij ij] such that 



ti-i 



If 



then it holds that 



<r(/l) < £<WXr t ) < / ff(xt)dt. (11) 

If we define / 2 (x) — max{0, f(x) — t m }, it follows from the coarea formula and the 
assumptions of the lemma that 



<K/ 2 ) < C||/ 2 || TV = C / MTV d« 



Finally, we have that 

<7(/ - /1 - /a) < - /alii- < C max 5,. 

l<z<m 

This shows that cr(/ 2 ) and <r(/ — fx — /a)) can b e made arbitrarily small by making 
m and t m large. Since a(f) < cr(/i) + c(/2) + c(/ — fi — h)), ( 10 1 follows from ( 11 ) 
and this completes the proof. □ 



3.2 Model for computing s min (/) 

Here I will provide a model for computing s m i n (/). Let /j £ F be a function whose 
support is a ball of diameter d, for example a characteristic function as suggested in 
the previous section. Let us assume that we are given a real number a > and want 
to avoid that |/ rcc (a;)| > a in regions where / truc (x) = 0. Let us also assume that the 
probability distribution of (Tf, G nolsc ) is to a good approximation Gaussian. 

Consider balls of diameter d in £1. The maximum number of such disjoint 
balls in Q is approximately Nd ~ \ fl\d~ n . Let f\, . . . , fg d be translations of fd with 
support in these disjoint balls. A heuristic argument suggests that we should look at 
the probability that \a x {f d , G noiso )| • > a. By Lemma[l] that probability is 

(assuming that (Tf d ,G aoisc ) is Gaussian) 
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Let us, rather arbitrarily, choose the regularization parameter so that the expected 
number of j G {1, . . . , Nj} with \a\(f^,G nolse )\ > a is not more than 1. This is 
equivalent to the inequality 



s\(fd) > s min (f d ) = v^erfc 1 



1 \ a\\Tf d 



2 



N d J *(f d )\\f d \\ L - (13) 



The formula is easily modified if some other restriction on the expected number of j 
fi 



with |a A (/j,G noisc )| > a is desired. 



4 A numerical example from electron tomography 

In this section I will apply the proposed parameter choice method to a numerical 
example from electron tomography (ET). First I give a brief description of this inverse 
problem. For details, see for example [3]. 

Electron tomography is a method using a transmission electron microscope to con- 
struct three-dimensional models of biological macromolecules and similar structures. 
The data collected consists of a series of images, a tilt series, with the specimen 
tilted in different angles. Hence the data space can be decomposed as a direct sum 
Y = Yi © ■ • • © Y m , where Yj corresponds to the jth image in the tilt series, and each 
vector g £ Y can be written as g = g\ + • — |- g m where gj G Yj . The forward operator 
T has a corresponding decomposition into components Tj : X — > Yj. 

A commonly used approximation of the forward operator is that each Tj consists 
of a parallel beam transform in a direction depending on j, followed by convolution 
with a point spread function. The noise in the data comes mainly from the stochastic 
nature of the detection of the electrons, and has a Poisson distribution. Due to the 
necessity of using a very low electron dose in each image, the noise level is usually very 
high. 

In this model, the assumption of translation invariance made in Section [3] is valid. 
Since (Tf, G nolsc ) is composed of noise from all the images, which can reasonably 
be assumed to be independent, the assumption that its probability distribution is 
Gaussian seems plausible according to the central limit theorem. 

4.1 Estimation of cr(f) 

In order to apply the proposed parameter choice method, it is necessary to estimate 
<t(/) for a given / G X. For tomographic data of the type encountered in ET, this 
estimate can be made directly from the data set, given the following very reasonable 
assumptions. 

1. The noise components in separate images are uncorrelated. 

2. The noise components in different parts of the same image are at most weakly 
correlated. 

3. The probability distribution of (Tj f G^ OIse ) is invariant under translation of /. 

4. In each image, the signal to noise ratio is much lower than 1. 
Now, by assumption [l] we have that 

m 

Var[(T/, G" oisc >] = ]T Var^/, Gf»>)]. (14) 

3=1 
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To estimate the right hand side of this equality, take a number of random translations 
/i, . . . , fi of / (this requires that the support of / is small compared to f2). I claim 
that Var[(7j/, G" OIse )] can be approximated by a sample variance 



VarKT^Gf 86 )] « y^y E f (^^9?°™) - 7 E r ^'>Si° iS< ) 

i=l V i'=l / 



(15) 



This is justified by assumption s [2| and [3] Finally, assumption [4] justifies that we 
can replace g nolse by g data in (15 1. Combining these steps leads to the following 
approximation: 



m . I / , I \- 

°V) 2 " E |3i E (^/-O - y E^^''0 

j=l i=l V i' = l / 



(16) 



4.2 Numerical results 

The numerical results presented in this section were obtained by approximately solving 
the minimization problem (|T|) with T the forward operator from electron tomography 
described above and g a real or simulated data set. The regularization functional is 
an approximation of the total variation norm, defined as follows. Let Xj^fc, (i,j, k) £ 
I C H? be a rectangular lattice of points at which the function / is sampled. For all 
{hji k) £ I we take f(xij.k) to be 0. Let I be the subset of Z 3 consisting of / together 
with all points adjacent to /. Let 



D tf( x i,0,k) 
D£f(Xi,j,k) 



f(Xi+l,j,k) ~ f{%i,j,h) 
/(Xij+l.fe) - f{Xi,j,k) 



be discrete partial derivatives of / in the forward direction, and similarly let 



D 2f( X i,j,k) 
D 3fi x i,j,k) 



f(Xi,j,k) - f{Xi-X,j,k) 

f( x i,j,k) - f( x i,j-l,k) 

f{ x i,j,k) - f{Xi,j,k-l) 



be discrete partial derivatives in the backward direction for all (i,j,k) € /. Now we 
define 



1/2 



(17) 



The parameter (3 is included in order to make the regularization functional smooth, 
which was necessary for the minimization algorithm used. (For alternative optimiza- 
tion algorithms which do not require this approximation, sec for example |5j.) It was 
set to a small positive value, (3 = 3 • 10~ 4 . This is well below the level where changes 
in (3 do not seem to have any noticeable effect on the solution. However, in the appli- 
cation of the parameter choice rule, (3 was set to 0, so that the condition ^ is exactly 
satisfied. 

The approximate solution of the minimization problem (JlJ was computed by it- 
eratively searching for the minimum in 2-dimensional subspaces of X, where each 
subspace is spanned by the gradient of the objective functional and a vector in the 
direction of the previous update. This method seems to be considerably faster than a 
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gradient descent method minimizing over a 1-dimensional subspace in each iteration. 
The iteration was continued until no appreciable change occured even after many iter- 
ations. The number of iterations used was in most cases between 500 and 1500, with 
a fairly good approximation of the end result occuring within 100 iterations. 

4.2.1 Simulated data set 

The first numerical example is reconstructed from a simulated data set. The phantom 
used in the simulation contains 30 Y-shaped objects of varying size and contrast. 
From this phantom an ET data set was simulated, consisting of 121 projections. The 
specimen was tilted about a single axis, with the tilt angle ranging from —60° to 60°. 
The simulated electron dose was 15.7 electrons per pixel on average over the tilt series. 
A section through the phantom and the central projection in the data set are shown 
in figure [l] 




Figure 1: Left: A section through the phantom. Right: The central projection in the 
data set. 

Next the parameter choice rule from the previous sections was applied to the data 
set. Note that the only input needed is the data set, a model for the forward operator 
and the threshold parameter a. A reasonable choice for the threshold parameter, 
given the overall levels of contrast in the phantom, seems to be a = 0.5. With this 
choice we should hope that the reconstructed objects are well above the noise level, 
even if the contrast is somewhat reduced by the regularization. Figure [2] shows the 
dependence of A on a, and the dependence of s m i n (fd) and sx(fd) on fd for the choice 
of A corresponding to a = 0.5. The diameter d is measured in voxel units. 

If the proposed parameter choice rule is applicable, A w 30 should be a suitable 
choice of the regularization parameter. To test if this is the case, a series of TV regu- 
larized reconstructions were computed with the regularization parameter ranging from 
12 to 48. The size of the reconstructions is 200 x 200 x 100 voxels. A section through 
each of the reconstructions is shown in figures [3j^7] Which one of these reconstructions 
would be considered optimal is of course strongly dependent on the type of further 
analysis it is intended for. 

To further investigate the amount of undesirable noise in the reconstructions, the 
following analysis was applied. For a given threshold a > and a given reconstruction 
/ rcc , the set {x e ft : f rcc (x) > a} was computed and decomposed into its connected 
components. (In the discrete setting, a voxel was considered to be connected to each 
of its 8 nearest neighbors.) A connected component was classified as a true hit if it has 
nonempty intersection with some object in the phantom, otherwise it was classified as 
a false hit. Hence, the number of true and false hits can be counted, where the count 
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Figure 2: Left: The solid line shows the dependence of the regularization parameter A 
on the threshold parameter a when it is chosen according to the proposed parameter 
choice rule. The graph is composed of line segments since a discrete set of diameters 
d were used in the computation. This graph is compared to the ideal parameter 
choice rule for this problem (stars), which can be determined by comparing a set 
of reconstructions with different regularization parameters to the true solution (see 
below). Right: The dependence of s m in(/d) (circles) and s\(fd) (stars) on d for a = 0.5 
and the corresponding A = 31.4. For d greater than approximately 6, s m i n (/d) drops 
below 0, and does not impose any restriciton on A. 




Figure 3: Left: Section through reconstruction with A — 12. Right: Section through 




Figure 4: Left: Section through reconstruction with A = 20. Right: Section through 
reconstruction with A = 24. 
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Figure 5: Left: Section through reconstruction with A = 28. Right: Section through 
reconstruction with A = 32. These are in the range obtained by applying the proposed 
parameter selection rule, depending on the threshold parameter. 




Figure 6: Left: Section through reconstruction with A = 36. Right: Section through 
reconstruction with A = 40. 




Figure 7: Left: Section through reconstruction with A = 44. Right: Section through 
reconstruction with A = 48. 
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depends both on the regularization parameter A and the threshold a. Only one true 
hit was counted for each of the objects in the phantom, so the number of true hits can 
never exceed 30. Table [l] shows how the number of true and false hits depend on the 
regularization parameter when a = 0.5. For A below a certain level, the number of 
false hits increases dramatically, and there is an obvious risk of misinterpretation. In 
this case, the proposed parameter choice rule makes a surprisingly accurate prediction 
of the point where false hits start to occur. 



A 


12 


16 


20 


24 


28 


32 


36 


40 


44 


48 


True hits 


25 


24 


22 


19 


19 


16 


14 


13 


13 


12 


False hits 


471 


98 


28 


5 


1 





1 












Table 1: The number of true and false hits at threshold level 0.5 as a function of the 
regularization parameter A. 

Another way to look at false hits is as follows. In a given reconstruction we can 
compute the smallest threshold a which does not give rise to any false hits. This defines 
a relation between a and A which can be considered as an ideal parameter choice rule. 
The only obvious way to determine this ideal parameter choice rule is to compute a 
set of reconstructions with different regularization parameters and compare them to 
the true solution / true , in order to classify reconstructed objects as true or false hits. 
This, of course, impossible in real life problems, primarily because / truc is not known. 
However, with simulated data, the ideal parameter choice rule can be compared to a 
practically applicable parameter choice rule. 

Such a comparison is shown in figure [l] above. The plot shows that, while there is 
certainly a discrepancy, the proposed parameter choice rule provides at least a rough 
approximation of the ideal parameter choice rule for this particular problem. The 
regularization parameter selected by the proposed rule tends to be too small when a 
is small and too large when a is large. The reason for this trend is not yet clear. 



4.2.2 Real data set of TMV specimen 

The second example uses a real ET data set of a specimen containing Tobacco Mosaic 
Virus (TMV). The TMV is a long and fairly rigid cylindrical object, approximately 
18 nm in diameter. The tilt series contains 61 projections, with the tilt angle varying 
in the range —60° to 60°. The electron dose was 64.5 electrons per pixel on average 
over the tilt series. 

Application of the parameter choice rule to the data set yields the dependence of 
the regularization parameter A on the threshold a shown in figure [8] This suggests 
that A w 70 should be a suitable choice. 

Sections through reconstructions with a range of regularization parameters are 
shown in figures [9 -11 The reconstructed volume contains two TMV's, which are 



clearly visible in the images. 

When working with real data it is of course not possible to know for certain which 
features in a reconstruction correspond to real objects in the specimen. In this case, 
all that we can be reasonably sure about is that the large elongated objects in the 
reconstructions indicate the presence of TMV's in the specimen. Smaller objects might 
be due to noise, but could also represent some contamination in the specimen. Table [2] 
lists the number of small objects above the threshold level 0.5 in the reconstructed 
volume for different values of the regularization parameter. 

The number of small objects in the reconstruction grows rapidly when the value of 
the regularization parameter goes below the value provided by the parameter choice 
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Figure 8: Left: The dependence of the rcgularization parameter A on the threshold 
parameter a when it is chosen according to the proposed parameter choice rule. Right: 
The dependence of s m i n (fd) (circles) and s\(fd) (stars) on d for a = 0.5 and the 
corresponding A = 73.5. 




Figure 9: Left: The central projection in the data set. Right: Section through recon- 
struction with A = 40. 





Figure 11: Left: Section through reconstruction with A = 70. Right: Section through 
reconstruction with A = 80. 



Regularization parameter A 


40 50 60 70 80 


Number of small objects 


129 40 17 5 1 



Table 2: The number of small objects, not classified as TMV's, at threshold level 
a = 0.5 as a function of the regularization parameter A. 



rule. Although it is impossible to know for sure if these small objects are due only 
to noise, the conclusion must be that this might very well be the case. Hence, no 
significance should be attached to these objects when the reconstruction is interpreted. 



5 Discussion 

The numerical examples presented suggest that the parameter choice rule outlined 
in this paper might be useful for inverse problems of the type encountered in ET. 
To further investigate under what circumstances the method would be useful, it is 
indispensable to gain a better understanding of the underlying mathematics. Once 
this has been done it should be possible to devise a pertinent set of numerical test 
cases. 

It might certainly be argued that in these examples, some interpretations of details 
could be more easily made from the reconstructions with a smaller regularization 
parameter than the one indicated by the proposed parameter choice rule. This suggests 



that perhaps (13 1 should be modified so that the computed value of s m i n is somewhat 
smaller. This could be done by relaxing the restriction on the expected number of 
j with \tt\(f 3 d , G nolsc )| > a, which was in the derivation above, rather arbitrarily, 
chosen as 1. However, if s m - m is decreased, both the heuristic argument and the 
numerical examples indicate that one should expect to have false objects appearing in 
the reconstructions, and the interpretations must be made with this in mind. 

In the application to ET data exemplified in this paper, the proposed parameter 
choice rule is heuristic, or error free, the magnitude of the noise being estimated directly 
from the data. It is well known that such heuristic parameter choice rules, can not 
provide convergent regularization methods in the strict sense, that is, methods that 
converge to the true solution when the noise level approaches zero. Such heuristic 
parameter choice rules might nevertheless be very useful in practice, when the noise 
level certainly does not approach zero. As mentioned in the introduction, one of the 
motivations for this new parameter choice rule is the difficulties associated with very 
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high noise levels. A variant of the proposed parameter choice rule would be to estimate 
the quantities cr(f) from some a priori estimate of the noise level. Used in this way, 
the parameter choice could provide a convergent rcgularization method. 

In the derivations leading up to the proposed parameter choice rule, particularly 
in the estimation of u(/), rather restrictive, albeit very realistic, assumptions were 
made on the statistical properties of the noise. It is worth noting that the necessity of 
making such assumptions, explicitly or implicitly, is inherent to the inverse problem in 
ET; without them no reconstruction would be possible at all. A malicious demon, if 
allowed to choose the noise with only a restriction on its norm, could easily hide every 
trace of the signal we are trying to recover. 

One advantage of the suggested parameter choice rule, as compared for example 
to the L-curve method, is its computational efficiency. In the numerical examples 
considered here, the application of the parameter choice rule has a lower computational 
cost than one single iteration of the algorithm subsequently used to compute the 
regularized solution. 

The considerations leading up to the parameter choice rule can also be employed 
in an alternative way. Suppose a rcgularization parameter has been chosen by some 
other method and a reconstruction has been computed. In the interpretation of the 
reconstruction it is then desirable to know which features are reliable, and which are 
likely to be an effect of noise. A computation of the quantity s\(f), where / is a 
certain feature in the reconstruction, can then be used to estimate how likely it is for 
such features to occur solely as a consequence of random noise. 
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